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q \ Abstract 

The experimental errors in current and future hadron colliders are expected to 
decrease to a level that will challenge the uncertainties in the theoretical calculations. 
One important component in the prediction uncertainties comes from the Parton Den- 
sity Functions of the (anti)proton. In this work we develop an interface from the Les 
Houches Accord Parton Density Functions (LHAPDF) package to the very popular 
Monte Carlo generator PYTHIA version 6.2. Then we proceed to estimate the PDF 
uncertainties for the production of Drell-Yan pairs from the Z pole to masses above 
1 TeV and for Higgs bosons at LHC. The measurement of the electro- weak mixing 
$_i 1 angle at LHC as a particularly difficult case is studied. 
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1 LHAPDF Interface to PYTHIA 6.2 

A hadron collider can be viewed as a wide band parton beam machine. To make predic- 
tions, the parton cross sections are folded with the parton density functions (PDF): 

^^[pp - x] ~ j: + (i « j)) * (i) 

a - cross section for the partonic subprocess ij — > X 
xi, x 2 - parton momentum fractions, 

fi/ p (p) (xi) - probability to find a parton i with momentum fraction x,- L in the (anti)proton. 

1 For further informations, see |http : / /www . ippp . dur . ac . uk/^qeorq/ lhclc/| . For questions 
and comments, please contact Georg . Weigleingdurham .ac.uk . 
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A long standing problem when performing such calculations is what is the uncertainty 
of the results coming from our limited knowledge of the PDFs, even if the parton cross 
section a is known very precisely The constantly increasing computing power allows to 
address this problem in new ways. 

The Les Houches Accord Parton Density Function interface was conceived at the Les 
Houches 2001 workshop in the PDF working group to enable the usage of Parton Density 
Functions with uncertainties in a uniform manner (TJ. The LHAPDF code is designed 
to work with PDF sets. In this approach a "fit" to the data is no longer described by a 
single PDF, but by a PDF set consisting of many individual PDF members. Calculating an 
observable for all the PDF members (by repeating the Monte Carlo generation N times, 
consuming lots of CPU time), one can estimate the uncertainty on the observable coming 
from the limits in our PDF knowledge. The LHAPDF code was built with this in mind 
and manipulates PDF sets. 

Some sets available in LHAPDF are: 

• CTEQ6 |2J with 40 members: uses 2 • Npdf eigenvectors for twenty PDF parameters, 
selected to represent the PDF variation; to estimate an uncertainty corresponding to 
~ 90 % CL the following sum is constructed: 

NpD F 

E [MS+) - MS-)} 2 

2 = 1 

• MRST2001 El with 2 members 

• Fermi2002 |4| with 100(1000) members: standard statistical procedures are used to 
estimate the standard deviation from the set 

• two Q 2 evolution codes: EVLCTEQ vl for CTEQ6 and QCDNUM v 16.12 for all 
other sets. 

The interface developed by the author 2 enables the use of modern PDFs, as provided 
by the Les Houches Accord Parton Density Functions (LHAPDF) package, from the very 
popular Monte Carlo generator PYTHIA |5| (version 6.2). The user can select for a given 
generation the PDF set and the member in the set with standard PYTHIA data cards. One 
member of a PDF set is fixed at initialization time and used throughout one run. The user 
can repeat the generation as many times as needed (or feasible!) in order to compute not 
only the central values (from the member representing the "best" fit in a set), but also 
the uncertainties of various observables (e.g. total or differential cross sections, forward- 
backward asymmetries etc.). 

1.1 Consistency Checks 

To check our implementation, we have computed some cross sections using different 
PDFs. The PYTHIA default is CTEQ5L. The same PDF is implemented in PDFLIB @ 
from the CERN library. The cross sections for Drell-Yan pairs (e + e~) with PYTHIA 6.206, 
generating each time samples of 100000 events, are shown in Table [TJ 

2 Available on request from bourilkov@mailaps.org. 
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Table 1: Cross sections for Drell-Yan pairs (e + e ) with PYTHIA 6.206. The errors shown 
are the statistical errors of the Monte-Carlo generation. 



PDF set 


Comment 


xsec 


81 < M < 101 GeV 


CTEQ5L 


PYTHIA internal 


1516 ± 5 pb 


CTEQ5L 


PDFLIB 


1536 ± 5 pb 


CTEQ6 


LHAPDF 


1564 ± 5 pb 


MRST2001 


LHAPDF 


1591 ± 5 pb 


Fermi2002 


LHAPDF 


1299 ± 4 pb 


M > 1000 GeV 


CTEQ5L 


PYTHIA internal 


6.58 ± 0.02 fb 


CTEQ5L 


PDFLIB 


6.68 ± 0.02 fb 


CTEQ6 


LHAPDF 


6.76 ± 0.02 fb 


MRST2001 


LHAPDF 


7.09 ± 0.02 fb 


Fermi2002 


LHAPDF 


7.94 ± 0.03 fb 



We have performed the calculations at the Z pole and for very high masses above 1 TeV 
in order to check the PDFs in different x regions. The probed values are summarized 
in Table 13 Good agreement between the CTEQ6 and MRST2001 sets for a large Q 2 range 
is observed, taking into account that two different evolution codes are used. The results 
are close to the cross sections obtained with CTEQ5L. Even for the same PDF we observe a 
minor difference between the internal PYTHIA implementation and the PDFLIB one. Each 
PDF set has a corresponding "best fit" or preferred value for the strong coupling constant 
a s and for the number of flavors. Our interface has not been optimized yet to give the best 
values for each set, and this will happen in the next release after detailed investigation. 
The necessary code is already foreseen in the current version and the values can be set by 
the user. 

Table 2: xi and x 2 for different masses and rapidities. 



y 





2 


4 




M = 


91.2 GeV 


Xl 


0.0065 


0.0481 


0.3557 


x 2 


0.0065 


0.0009 


0.0001 




M = 


200 GeV 




0.0143 


0.1056 


0.7800 


x 2 


0.0143 


0.0019 


0.0003 


M = 


1000 GeV 


Xl 


0.0714 


0.5278 




x 2 


0.0714 


0.0097 





3 



2 PDF Uncertainties 



As discussed above, the uncertainty on an observable is estimated by repeating the cal- 
culation many times using in turn all the members of a PDF set. As we are generating 
events with PYTHIA here, we have to be careful not to overestimate the PDF uncertainty 
Namely, the observable under study will have a natural variation coming from the limited 
Monte-Carlo statistic. This complication is taken into account by: 

• when possible generating large enough samples so that the MC error is an order of 
magnitude lower than the PDF contributions 

• when this is not feasible, unfolding the MC contribution from the total variation to 
extract the "pure" PDF part; if the observed variation is equal to the expected from 
the Monte-Carlo statistic, this means that we cannot estimate the PDF uncertainty 
and can only give an upper limit - improving in such a situation entails generating 
huge samples. 

2.1 Results at the Z Pole 

We start with cross sections for Drell-Yan pairs (e + e _ ) with PYTHIA 6.206. This time a 
cut on the rapidity < 2.5 is applied, corresponding to the barrel and endcaps of a LHC 
detector. Samples of 1200000 events are generated for each member of a set. This process 
probes the (anti)quark PDFs. 

Table 3: Cross sections for Drell-Yan pairs (e + e~) with PYTHIA 6.206. The errors shown 
are the PDF uncertainties. 



PDF set 


Comment 


xsec [pb] 


PDF uncertainty % 


81 < M < 101 GeV 


CTEQ6 


LHAPDF 


1065 ± 46 


4.4 


MRST2001 


LHAPDF 


1091 ± ... 




Fermi2002 


LHAPDF 


853 ± 18 


2.2 



Again a good agreement between the CTEQ6 and MRST2001 sets is observed. If we 
take into account that in the CTEQ6 approach the estimate corresponds to ~ 90 % CL, 
then at the one standard deviation level the PDF uncertainties are small (~ 2-3 %) and 
the two methods of estimation give close values. While the CTEQ6 and Fermi2002 sets 
agree on the uncertainties, the difference in their central values requires further study. 

2.2 Higgs Production 

The cross section for gg — ► H probes the gluon PDFs, so this channel is complementary 
to the case considered above. The CTEQ6 set is used. Samples of 100000 events are 
generated for each member of the set and for each value of the Higgs mass, as shown 
in Table H 
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Table 4: Cross sections for Higgs production in gg — > H with PYTHIA 6.206. The errors 
shown are the PDF uncertainties. 



Higgs mass [GeV] 


xsec [pb] 


PDF uncertainty % 


120 


16.26 ± 0.62 


3.8 


160 


10.46 ± 0.32 


3.1 


200 


7.41 ± 0.19 


2.6 



We can observe falling PDF uncertainties for higher Higgs masses. They are always 
below 4 %, and we should take them with a grain of salt, as for this process extrapolations 
are used for very low x, and the PDF uncertainties are not known so precisely there. 

3 Determination of sin 2 ^ e ff pt (M|) at LHC 

As studied for the LHC SM workshop (see [7,8] and references therein), the huge statistics 
of lepton pairs from Z decays can open the possibility to perform a competitive measure- 
ment of the electroweak mixing angle sin 2 ^gg >t (M|) at LHC, provided that we can control 
to the needed precision the proton PDFs. This question was left open, and the tools de- 
veloped here allow to address it again. 

For this end a very precise measurement of the forward-backward asymmetry at the 
Z pole is needed. For a (x x > x 2 ) pair of partons producing the Z we have 4 combina- 
tions of up- or down-type quarks initiating the interaction: uu, uu, dd, dd. In pp collisions 
the antiquarks come always from the sea and the quarks can have valence or sea ori- 
gin. Going to higher rapidities increases the difference between Xi and x 2 and hence 
the probability that the first quark is a valence one. This allows a measurement of the 
forward-backward asymmetry even for the symmetric initial pp state. At the TEVATRON 
the proton-antiproton configuration provides a natural label for the quark or anti-quark. 

From this discussion it is clear that going to highest rapidities is helpful at LHC. A cen- 
tral detector with \r)\ < 2.5 has a much reduced sensitivity compared to the TEVATRON. 
A competitive measurement using ~ 100 fb _1 at LHC starts at Asin 2 6'| ! < | ,t (M|) = 0.0005 per 
channel, or Asin^^^M!) = 0.00025 for 2x2 channels (electrons and muons) and exper- 
iments. The best single measurement today from SLD |9[ using the left-right asymmetry 
gives Asin 2 ^ e ff pt (Mi) = 0.00026. 

In the following we outline a possible strategy to do the measurement. Events are kept 
for analysis by applying the following cuts for both leptons: 

• pseudorapidity \rf\ < 2.5 

• transverse momentum p T > 10 GeV 

which cover the barrel and endcaps of a typical LHC detector. The backgrounds for these 
final states are low and can be suppressed further by isolation cuts. The dependence of 
the asymmetry on the measured angle is given by: 

A(A FB ) = 
A(sin 2 4 pt (Ml)) 
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Not knowing the quark direction at LHC leads to values k < 1 i.e. reduced sensitiv- 
ity. Samples of ~ 110 million events (after cuts) for different values of sin^g^Ml) are 
generated and the results shown in Table 

Table 5: Asymmetry measurement at the Z pole at LHC. The coefficient k, the required 
error on A F b for a measurement with precision Asin^g^Ml) = 0.0005 in one channel 
and one experiment, the upper limit on the PDF uncertainty and the number of events 
needed are shown for three rapidity intervals. In the last row the expected events in one 
channel for one experiment with luminosity 100 fb _1 and detection efficiency 100 % in the 
acceptance region are given. 



Rapidity 


0.0 - 0.8 


0.8-1.6 


1.6-2.4 


k 


-0.021 


-0.38 


-0.59 


A(A FB ) 


0.00001 


0.00019 


0.000295 


PDF uncertainty 


< 0.00048 


< 0.00053 


< 0.000820 


Events needed 


10000 M 


27.7 M 


11.5 M 


Events in one 








ch/exp 100 fb" 1 


30.7 M 


25.0 M 


10.5 M 



We may perform two independent measurements in the rapidity intervals 0.8-1.6 
and 1.6-2.4 in order to control the experimental systematics. From the table we see that 
27.7(11.5) x 10 6 events in the two intervals are needed, which are available from one chan- 
nel in one experiment for ~ 110 fb -1 if the detection efficiency in the acceptance region 
is 100 % (for lower efficiency the luminosity has to be rescaled accordingly). Clearly the 
statistical error is not a problem, and we have two independent measurements for each 
channel. From a big run with CTEQ6 - 40 x 15 x 10 6 events, we have determined the up- 
per limits at 90 % CL of the PDF uncertainties for different rapidity intervals, as shown 
in Table 13 Even with this statistics we have not reached a systematic show-stopper, so 
the question remains open: it is possible that the PDF uncertainty will not be the limiting 
factor, but runs with huge samples are needed to determine it reliably. 

A Appendix 

The interface is designed along the lines of the existing interface from PYTHIA to PDFLIB 
(see e.g. CERNLIB). 

1. If the user is not interested in LHAPDF, standard PYTHIA can be used OR the file 
pythia6206.f 

from the repository. This file has been modified as follows: two PYTHIA rou- 
tines (PYINIT, PYPDFU) will perform additional actions if MSTP(52)=3. And two 
dummy routines LHALAUNCH and LHAPARD are added close to the end of the 
file. So if the user does not set MSTP(52)=3, nothing changes. And if the user just 
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happens to set MSTP(52)=3, the dummy routines will be called, they will issue an 
error message and stop the execution. For using LHAPDF please see 2). 

2. If the user is interested in LHAPDF, the file 
pythia62061ha.f 

from the repository should be used. Compared to pythia6206.f the only change is 
that the two dummy routines LHALAUNCH and LHAPARD have been taken out. 
Now the user has to link in addition the library 

libLHAPDF.a 

which is built using the file Makefile (just give the command make at the UNIX 
prompt). 

Some files from LHAPDF have been modified for technical reasons and to facilitate 
ease of use and transparency for the user. In the original LHAPDF code there are two 
evolution codes: EVLCTEQ used with the CTEQ PDFs, and QCDNUM for other PDFs. 
The user is expected to build the library corresponding to ONE of them and link it. So the 
choice of PDF cannot be done at runtime and to use the second option one has to rebuild 
the library and relink. 

By modifying the code, I have simplified the use: the library is built only ONCE, and 
contains the two evolution codes. To use LHAPDF one sets MSTP(52)=3. The user can 
select the PDF set and the member of the set at runtime with MSTP(51): 

• CTEQ6(file cteq6 . LHpdf ): 

MSTP(51) = 1000-1040 (1000 - best fit, 1001-1040 - PDF members) 

• MRST2001 (f ile MRST2 1 . LHpdf ): 

MSTP(51) = 2000-2002 (2000 - best fit, 2001-2002 - PDF members) 

• MRST98 (f ile MRST98 . LHpdf ): 

MSTP(51) = 2100-2103 (2100 - best fit, 2101-2103 - PDF members) 

• Fermi2002 (f ile Fermi2 02_1 . LHpdf ): 

MSTP(51) = 3000-3100 (3000 - best fit, 3001-3100 - PDF members) 

• Fermi2002 (f ile Fermi2 02_1 . LHpdf ): 

MSTP(51) = 4000-5000 (4000 - best fit, 4001-5000 - PDF members). 

The PDF sets are stored for convenience in the subdir PDFsets (can be downloaded 
from the LHAPDF page). The routine LHALAUNCH is hardcoded to fetch them from 
this subdir - if the user decides to store them elsewhere, the code should be modified to 
reflect the new location. 

Each PDF set has a corresponding "best fit" or preferred value for the strong coupling 
constant a s and for the number of flavors. The interface has not been optimized yet to 
give the best values for each set, and this will happen in the next release after detailed 
investigation. The necessary code is already foreseen in the current version and the values 
are set in LHALAUNCH. 

ENJOY! 
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